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ON AN ALGORITHM TO DECIDE WHETHER A FREE 
GROUP IS A FREE FACTOR OF ANOTHER* 

Pedro V. Silva 1 and Pascal Weil 2 

Abstract. We revisit the problem of deciding whether a finitely gen- 
erated subgroup H is a free factor of a given free group F. Known 
algorithms solve this problem in time polynomial in the sum of the 
lengths of the generators of H and exponential in the rank of F. We 
show that the latter dependency can be made exponential in the rank 
difference rank(F) — rank(_ff), which often makes a significant change. 

Resume. Nous revenons sur la question de decider si un sous-groupe 
liniment engendre H est facteur libre d'un groupe libre donne F. On 
trouve dans la litterature des algorithmes qui resolvent ce probleme en 
temps polynomial en la somme des longueurs des generateurs de H , et 
exponentiel en le rang de F. Nous montrons que Ton peut remplacer la 
dependance exponentielle en rank(F) par une dependance exponentielle 
en la difference rank(F) — rank(ff), ce qui change souvent les choses de 
fa§on considerable. 
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The combinatorial aspects of group theory have attracted the attention of the- 
oretical computer scientists for a long time, and for a variety of reasons. There 
is no need to recall the importance of the concept of monoid (e.g. free, finite) 
in the theory of automata since the foundational results of Schiitzenberger and 
Eilcnberg (see the books [1, 18]), and groups form a special case of monoids that 
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sometimes play an important role in purely language- and monoid-theoretic ques- 
tions (e.g. the type II conjecture, see [6] for a survey). Algorithmic questions 
(the word problem, the conjugacy problem, etc.) have been very influential in 
group theory throughout the 20th century, starting from the work of Dchn, and 
specialists of combinatorics on words find a particular interest in the challenges 
posed by the analogous combinatorics of the free group. Recent work on impor- 
tant non-commutative groups like the Thompson group and the so-called automata 
groups strongly relies on the formalism of finite state automata (see [3,7,22] for 
recent examples). It is already a classical result that these automata-theoretic 
and combinatorial points of view converge in the (admittedly simpler) study of 
the subgroups of free groups, this is central in the algorithmic problem tackled 
here and is discussed in detail in the first part of this paper. 

Let us also mention another reason for the recent multiplication of research 
projects on the boundary between computer science and combinatorial group the- 
ory. Public-key cryptography relies heavily on group theory: sometimes finite 
groups such as the groups of units in modular arithmetic, or the groups of ratio- 
nal points on elliptic curves over finite fields, sometimes infinite non-commutative 
groups like the braid groups (see for instance [2,4,21] and many others). At 
any rate, the design of more robust cryptographic schemes and the attack of such 
schemes rely on a deeper understanding of the combinatorial and algorithmic prop- 
erties of non-commutative groups. 

As mentioned above, the combinatorial and algorithmic problems concerning 
free groups are of special interest. Free groups are archetypal groups, whose struc- 
ture is far from being totally elucidated, and the efficient solution of standard 
problems in their context can shed some light on the possible solution of the same 
problems in more complex groups. Moreover, the solution of algorithmic prob- 
lems in free groups may be more attainable since we can use the resources of 
combinatorics on words and automata theory. 

We now present the specific algorithmic problem addressed in this paper. For 
the classical facts about free groups recorded below without a reference, we refer 
the reader to the book by Lyndon and Schupp [12]. It is well-known that the 
minimal sets of generators, or bases, of a free group F all have the same cardinality, 
called the rank of F. Moreover, if F has finite rank r, every r-element generating 
set of F is a basis, see [12, Prop. 1.3.5]. In this paper, we consider only finite rank 
free groups. 

Let H be a subgroup of a free group F, written H < F. Then H itself is a free 
group whose rank may be greater than the rank of F. We say that H is a free 
factor of F, written H <ff F, if there exist bases B of H and A of F such that 
B C A (free factors can be defined in all groups by a universal property, but the 
operational definition given here is sufficient for the purpose of this study). It is 
well known that one can decide whether a given finite rank subgroup H < F is 
a free factor of F, but the known algorithms have a rather high time complexity. 
More precisely, the best of these algorithms require time that is polynomial in the 
size of H and exponential in the rank of F, see Section [TT31 below for the details. 
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Here, the size of H is taken to be the sum of the lengths of a finite set of generators 
of H in F. 

We propose a new algorithm to decide whether a given finitely generated sub- 
group H is a free factor of the free group F, which is polynomial in the size of H 
and exponential in the rank difference between F and H . In many instances, this 
represents a substantial advantage over exponential dependency in the rank of F. 

Our algorithm relies essentially on a careful analysis of the construction of the 
graph representation of H. More precisely, once a basis A of the ambient free group 
F is fixed, there is a natural and elegant representation of the finitely generated 
subgroups of F by A-labeled graphs (or inverse automata). This construction — 
a graphical representation of ideas that go back to the early part of the twentieth 
century [19, Chap. 11] — was made explicit by Serre [20] and Stallings [23]. It has 
been used to great profit by many authors since the late 1970s, see [8, 13, 14,26] 
for recent examples. Given a finite set of generators of H (as reduced words 
over the alphabet A U A" 1 ), the graph representation of H can be effectively 
constructed (see [23], [14], etc). The number of vertices and edges of this graph 
is bounded above by £, the sum of the lengths of a set of generators of H, and 
the whole representation can be computed in time at most 0(£ 2 ) (in fact, in time 
0{l\og* I) according to a recent announcement^] by Touikan [25]). We discuss this 
representation in more detail in Section 11.21 below, and we show in Sections [2] and 
[3] how to use it to decide more efficiently the free factor relation. 

It is interesting to note that our algorithm is the first to be expressed entirely 
in terms of the graph representation of H . Let us also emphasize that we do not 
claim that our algorithm is optimal. It is an open question whether one can decide 
the free factor relation H <ff K in time polynomial both in the size of H and in 
the size, or the rank of K . 

1. Background 

If A is a basis of a free group F, we often write F = F(A) and we represent the 
elements of F as reduced words over the alphabet A. More precisely, we consider 
the set of all words on the symmetrized alphabet A — A U A^ 1 , where A^ 1 = 
{a -1 | a G A} is a set that is disjoint from A, equipped with an explicit bijection 
with A, namely a i— » a -1 . It is customary to extend the mapping u t— > u^ 1 to all 
words u G A* by letting (a -1 ) -1 = a for each a £ A, l^ 1 = 1 (where 1 denotes the 
empty word) and (aida ■ ■ • a n ) _1 = a^ 1 • ■ ■ a^ 1 * 2 ! -1 f° r all ai, . . . , a n G A A word 
in A* is reduced if it contains no factor of the form aa^ 1 or a _1 a with a G A, and 
it is well known that F can be identified with the set of reduced words over A. 
We denote by p the map that assigns to each word u the corresponding reduced 
word up G F(A), obtained by iteratively deleting all factors of the form aa^ 1 or 
a~ 1 a (a G A). 



For a positive integer n, log*(n) is the least integer k such that the fc-th iterate of the log 
function of n is at most 1. The growth of log*(ra) is so slow that it can be considered a constant 
for all practical purposes. . . 
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1.1. On inverse automata 

We describe the main tool for the representation of subgroups of free groups 
in terms of automata (see [17]). Readers less familiar with this terminology may 
think of automata as edge-labeled directed graphs. 

An automaton on alphabet A is a triple of the form A = (Q,qo,E) where Q is 
a finite set called the state set, qo G Q is the initial state, and E C Q x A x Q 
is the set of edges, or transitions. A transition (p, a, q) is said to be from state p, 
to state q, with label a. The label of a path in A (a finite sequence of consecutive 
transitions) is the sequence of the labels of its transitions, a word on alphabet A, 
that is, an element of the free monoid A* . We write p — q if there is a path from 
state p to state q with label u. The language accepted by A is the set L(A) of all 
words in A* which label a path in A from qo to go- 

This definition of automata leads naturally to the definition of a homomorphism 
ip from an automaton A = (Q, qo,E) to an automaton A' = (Q 1 , q' , E') (over the 
same alphabet A): ip is a mapping from Q to Q' such that <p(qo) = q'o, and such 
that whenever (p, a, q) G E, we also have (</?(p), a, </?(<?)) G £". The homomorphism 
ip is an isomorphism if it is a bijection and if p^ 1 is also a homomorphism. 

The automaton .4 is called deterministic if no two distinct edges with the same 
initial state bear the same label, that is, 

(P, a, q), {p, a, q') G E => q = q' . 

The automaton is called trim if every state q G Q lies in some path from qo to go- 
In the sequel, we consider automata where the alphabet is symmetrized, that 
is, the alphabet is of the form A = A U A^ 1 . We say that A is dual if for each 
a G A, there is an a-labeled edge from state p to state q if and only if there is an 
a _1 -labeled edge from q to p, 

(P,a,q) G E (g,a _1 ,p) G E. 

Let us immediately record the following fact. 

Fact 1.1. Let A be a deterministic dual automaton. If a word u labels a path 
in A from state p to state q, then so does the corresponding reduced word up. 
Moreover L(A) is a submonoid of A* and L(A)p is a subgroup of F(A). 

Now let A = (Q,qo,E) be a trim dual automaton and let p, q G Q be states 
of A If w = a\ ■ ■ ■ a n G A* is a non-empty word, the expansion of ^4 by (p, w, q) 
is the automaton obtained from A by adding n — 1 vertices gi, . . . , (fo-i and 2n 
edges 



ai a2 «n-l 

p — ► qi — >■■■ — > q n - 



and 



q — >q n -i — > • • • — ><7i — >P- 
Note that this automaton is still trim and dual. Moreover, if p = q = qo, then we 
observe the following. 
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Proposition 1.2. Let A = (Q,qo,E) be a trim dual automaton, let H = L(A)p 
and let w be a non-empty word. If B is the expansion of A by (qo,w,qo), then 
L(B)p is the subgroup generated by H and wp, that is, L{B)p — (H,w). 

Proof. Let C be the dual automaton consisting of the state qo and the states and 
edges added to A in the expansion. It is immediate that L(C)p is the subgroup of 
F(A) generated by wp. 

If u G L(B), we can factor a path qo -^qo according to the successive visits 
of state qo. The resulting factorization of u makes it clear that u is a product of 
elements of L(A) and L(C). Thus, L(B) is the submonoid generated by L(A) U 
L(C), and L(B)p is the subgroup generated by L(A)p and wp. This concludes the 
proof. □ 

1.2. Reduced inverse automata 

The automaton A is called inverse if it is deterministic, trim and dual. It is 
reduced if every state q G Q lies in some path from qo to qo , labeled by a (possibly 
empty) reduced word. We note the following result, a cousin of [24, Thm 1.16]. 

Proposition 1.3. // A and B are reduced inverse automata such that L(A)p = 
L(B)p, then A and B are isomorphic. 

Proof. Let A — (Q,qo,E) and B = (P,po,D) be reduced inverse automata such 
that L(A)p = L(B)p. We construct an isomorphism ip between A and B as follows. 
We first let <p{qo) = Po- 

Let q G Q. Since A is reduced, there exist reduced words u and v such that the 
word uv is reduced, qo 1 an d q—^qo- Then uv G L(A)p, so uv G L(B)p, and 
hence uv G L(B) by Fact 11.11 Thus uv labels a path in B from p to po, and we 
let <p(q) be the unique state in P such that po — > ™~^Po- 

We first verify that Lp is well defined. Suppose that uv and u'v' are reduced 

words such that qo — q —> qo and qo -—f q —> qo in A. We want to show that 

if po -—^ P Po an d Po —>p' —^Po in B, then p = p' . We note that u'v labels 
a path from qo to go m A. If u'v is a reduced word, then by the same reasoning 

as above, u'v labels a path in B from po to po, say, po — ^ p" — — > Po an d the 
deterministic property of B implies that p' = p" = p. 

If u'v is not reduced, and a is the first letter of v, then the last letter of u' is 
a -1 while the last letter of u is not a -1 . Therefore u'u^ 1 is reduced, u'u^ 1 G £(.4) 

and again, there is a path in B of the form po -^>p" - — >po- By determinism, it 
follows that p' = p" = p. 

This shows that p is well defined. A dual construction yields a well-defined 
mapping ip from P to Q such that, whenever po——>p—^Po m S and w is a 
reduced word, then go 9o in A. Using the determinism of A and B, it 

is now immediate that ip o ip is the identity on Q and 99 o -0 is the identity on P. 

There remains to verify that if and <p~ l are homomorphisms. The case of 
is dual of that of Lp and we treat only the latter. That is, we want to show that if 



G 
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(q,a,q') is a transition in A, then ((/?(g), a, </?(g')) is a transition in B. Let uv and 
u'v' be reduced words such that q — > g — ^ g n an( i 9o <7o- hi particular, 

we have po — — > f(q) —^po and po — — » f(q') -~^po hi B. 




v J 



If uav' is reduced, then in B, there is a path from po to po labeled uav 1 , and 
by determinism, there is a transition (ip(q),a,(p{q r )). If itai/ is not reduced, then 
cither ua is not reduced or av' is not reduced. If ua is not reduced, then u = 
u\a~ x and by determinism, go —^q'- As in the first part of the proof, it follows 
that at least one of u\v' and u\u' 1 is reduced, so po — l -* </?(<?') in £> and hence 
there is a transition (</?(<?), a, Lp(q')). The case where at/ is not reduced is handled 
symmetrically, and this concludes the proof. □ 

Let H be a subgroup of F(v4). Say that an automaton A on alphabet A repre- 
sents H if A is reduced and inverse and if L(A)p = H. Proposition 11.31 shows that 
there exists at most one such automaton, and we denote it by Ta(H) if it exists. 
We now discuss the existence and the construction of Ta(H) when H is finitely 
generated. (As it turns out, Ta(H) always exists, but our interest in this paper is 
restricted to the finite rank case.) 

Let A be an automaton and let p, q be distinct states of A. The automaton 
obtained from A by identifying states p and q is constructed as follows: its state 
set is Q \ {p, q} U {n}, where n is a new state; its initial state is qo (or n if p or q is 
equal to qo)', and its set of transitions is obtained from E by replacing everywhere 
p and q by n. If A is trim or dual, then so is the automaton obtained from A by 
identifying a pair of states. 

Now let A be a dual automaton. If A is not deterministic, there exist transitions 
(r, a,p) and (r, a, q) with p ^ q and a G A. Identifying p and q yields a new dual 
automaton B, and we say that B is obtained from A by an elementary reduction 
of type 1. 

Fact 1.4. Let A be a dual automaton and let B be obtained from A by an 
elementary reduction of type 1. Then L(A)p = L(B)p. 

Proof. It is easily seen that L(A) C L(B). For the converse, we use the notation 
given above: in B, the states p and q of A are replaced with a new state n. Let 
u G L(B). Then there exists a path labeled u from the initial state of B (say, go) 
to itself. If that path does not visit state n, then u also labels a path from go to 
itself in A and hence u G L(A). 
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If that path does visit state n, we consider the factorization of u given by the 
passage of that path through n: we have u = uqU\ ■ ■ • u r , r > 1 and 



It follows that in A, t^-labelled paths exist, with end states p or q (or go). Then 
one of uq and uoa~ 1 a labels a path in A from go to q. Similarly, one of u r and 
a~ 1 au r labels a path from q to qo (making due allowance if p or q is equal to qo). 
And for each 1 < i < r, one of Uj, a~ 1 au i , u i a~ 1 a and a~ 1 au i a~ 1 a labels a path 
in A from g to q. Therefore, there exists a path in A of the form qo — go such 
that up = vp, which concludes the proof. □ 

Now assume that A is a deterministic dual automaton. 

Fact 1.5. Let A be an inverse automaton. Then A is non-reduced if and only if 
there exist states q ^ qo and p, and a letter a € A such that the only transitions 
of A involving q are (p, a, q) and (q, a ,p). 

In graph-theoretic terms, this means that A is reduced if and only if no vertex 
of A has degree one (more precisely: no vertex is adjacent to a single A-labeled 
edge), except possibly go- 

Proof. By definition, A is not reduced if and only if there exists a state q that 
does not lie on any path from q to itself, labeled by a reduced word. We first 
observe that the state q cannot be equal to qo since the empty word is reduced, 
and labels a path from qo to itself. Suppose now that there exist reduced words 
u,v with distinct last letters, labeling paths from g to q: then uv~ x is a reduced 
word, labeling a path from go to itself and visiting q. On the other hand, if every 
reduced word labeling a path from go to g ends with, say, letter a € A, then every 
path from go to itself visiting q has a non-reduced label. Thus A is not reduced 
if and only if there exists a state q ^ qo and every reduced word labeling a path 
from go to g ends with the same letter. By determinism, this is equivalent to the 
existence of another state p such that the transitions involving q are (p, a, g) and 



Let A be inverse and not reduced, and let a,p,q be as in Fact 11.51 If B is 
obtained from A by omitting state q and the transitions involving it, we observe 
that B is again an inverse automaton, and we say that B is obtained from A by 
an elementary reduction of type 2. 

Fact 1.6. Let A be an inverse automaton and let B be obtained from A by an 
elementary reduction of type 2. Then L(A)p — L(B)p. 

Proof. Let a be a letter and let p, q be states of A as in Fact ll.5[ and let us assume 
that B is obtained from A by omitting state g and the transitions involving it. It is 
easily seen that L(B) C L(A). Conversely, let u e L(A). By FactOl up £ L(A). 
Now Fact 11.51 shows that the path go — > pqo in A cannot visit state g, since up is 
a reduced word. It follows that this path is also a path in B, up € L(B) and hence 




n 



n - ■ ■ n 




(q,a \p). 



□ 



up e L(B)p. 



□ 
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Let A be a trim, dual automaton, and let B be an automaton obtained by 
iteratively performing elementary reductions, first of type 1 until the automaton 
is inverse, and then of type 2 until none is possible. Then B is a reduced inverse 
automaton, we write B = Ap and we say that B is obtained from A by reduction. 
Moreover, Facts [PI and [11)1 show that L(A)p = L(B)p. 

This leads directly to the well-known algorithm to construct a reduced inverse 
automaton representing a given finitely generated subgroup H. Let h\,...,h n 
be generators of H , and let us consider the automaton obtained from the trivial 
automaton (one vertex qo, no transitions) by performing successively expansions 
by (qo,hi,qo) (1 < i < n) and then reducing the automaton. It follows from 
Propositions 11.21 and 11.31 that the resulting automaton is (H ) . Note that it 
does not matter which set of generators of H was used, nor in which order the 
elementary reductions were performed. 

Remark 1.7. This construction of Ta(H) is well known, and can be described in 
many different ways, notably in terms of immersions over the bouquet of circles 
(Stallings [23]) or of closed inverse submonoids of a free inverse monoid (Margolis 
and Meakin [13]). 

Fact 1.8. There is a well-known converse to the above construction: if A is a 
reduced inverse automaton and H = L{A)p, then H has finite rank and a basis for 
H can be computed as follows (see Stallings [23]). Given a spanning tree T of the 
(graph underlying the) automaton A, for each state p, let u p be the reduced word 
labeling a path from go to p inside the tree T. For each transition e = (p, a, q), let 
b e = UpCiUq 1 : then a basis of H consists of the elements b e , where e runs over the 
transitions e = (p, a, q) not in T and such that a £ A. 

We note that, given a finite set h\, . . . , h n of elements of F (A) with total length 
£ = J^i Mi one can construct Ta(H) in time at most 0(£ 2 ) and Ta(H) has 
v < £ — n + 1 states. Moreover, finding a basis of H can be done in time at most 
0(v 2 ) (0(wlog* v) according to Touikan's announcement [25]), and the rank of H 
is equal to e — v + 1, where e is the number of edges in Ta(H). 

1.3. On the complexity of Whitehead and other algorithms 

It is well known that one can decide, given a subgroup H of a finite rank free 
group F, whether if is a free factor of F. We briefly describe here the main known 
algorithms and discuss their complexity. 

Let H be a finitely generated subgroup of a free group F of rank r, with basis 
A. Let hi, . . . , h n be a generating set of H. By the results summarized in Fact II. 81 
up to a quadratic time computation, we may assume that hi, . . . , h n is a basis of 
H. Let i= \hi \ + ■ ■ ■ + \h n \ be the total length of the tuple (hi)i, and let d = r — n 
be the rank difference between F and H - which we assume to be positive, since 
H can be a proper free factor of F only if n < r. 

Federer and Jonsson (see [12, Prop. 1.2.26]) gave the following observation and 
decision procedure: H is a free factor of F if and only if there exist d words 
h n+ i, . . . , h r , each of length at most max{|ft,j| | 1 < % < such that h\,...,h r 
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generate the whole of F. The resulting algorithm requires testing every suitable 
d-tuple of reduced words on alphabet A. Each of these tests (does a certain r-tuple 
of words generate F?) takes time polynomial in the total length of the r-tuplc, 
and hence in d£. However, the number of tests is 0(r dl ), which is exponential in 
£ and d. 

This approach leads to the following. 

Fact 1.9. Deciding whether H <ff K is in NP, with respect to dl. 

Proof. To verify that H <ff K, we need to guess d words of length at most £, and 
verify that together with H, they generate F, which can be done in 0((d£) 2 ). □ 

Another approach is based on the use of Whitehead automorphisms. We refer 
the readers to [12, Sec. 1.4] for the definition of these automorphisms, it suffices to 
note here that the set W of Whitehead automorphisms of F which do not preserve 
length, has exponential cardinality (in terms of r). A result of Whitehead [12, 
Prop. 1.4.24] shows the following: if there exists an automorphism tp of F such 
that the total length of (tp(hi))i is strictly less than £, then there exists such an 
automorphism in W. In particular, an algorithm to compute the minimum total 
length of an automorphic image of the tuple (hi)i consists in repeatedly applying 
the following step: try every automorphism ip S W until the total length of (ip(hi))i 
is strictly less than the total length of (/ij),; if such a tp exists, replace by 
(ip(hi))i; otherwise, stop and output the total length of (hi)i. 

This applies to the decision of the free factor relation since H <ff F if and 
only if there exists an automorphism tp mapping a basis of H to a subset of A. 
Thus an algorithm consists in first computing a basis of H, and assuming that 
hi, . . . , h n is a basis, verifying whether the minimum total length of (tp(hi))i when 
tp runs over the automorphisms of F, is exactly n. This algorithm may require 
0((£ — n)card(W)) steps, each of which consists in computing the image of a tuple 
of length at most I under an automorphism, and hence has complexity 0(C). Thus 
the time complexity of this algorithm is 0(£ 2 card(W)), which is quadratic in £ 
and exponential in r. 

A variant of this algorithm was established by Gcrsten [5] , who showed that a 
similar method applies to find the minimum size (number of vertices) of Ta 
when tp runs over the automorphisms of F(A). It is clear that H is a free factor 
of F(A) if and only if there exists an automorphism tp such that T a(<p(H)) has a 
single vertex. The time complexity is computed as above, where the number of 
vertices of Ta(H) is substituted for the total length of a basis for H. As noted 
earlier, this number of vertices is usually substantially smaller than the total length 
of a basis, but the two values are linearly dependent, so the order of magnitude of 
the time complexity is not modified, notably the exponential dependence in r. 

Remark 1.10. The discussion of Whitehead's algorithm above concerns only 
the so-called easy part of the algorithm (see for instance Kapovich, Miasnikov 
and Shpilrain [9]). Results by Miasnikov and Shpihain [15], Khan [10], and most 
recently by Donghi Lee [11] on the possible polynomial complexity of the hard part 
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of the algorithm also consider the rank of the ambient free group as a constant, 
and do not question the actual exponential dependence in that parameter. 

2. A CAREFUL LOOK AT THE EXPANSIONS AND REDUCTIONS OF 

INVERSE AUTOMATA 

Let A be a reduced inverse automaton. 

Let B be obtained from A by performing an expansion, say by (p,w,q), and 
then reducing the resulting automaton. In this situation, we write A — B, 
or simply A — > e x P B. We distinguish two special cases. 

• If the reduction following the expansion does not involve identifying or omit- 
ting states of A, or equivalently if A embeds in £>, we say that B is obtained from 
A by a reduced expansion and we write A — >[l ,w ^ B or A — > re B. 

• If the states p and q are equal to the distinguished state qo of A, we say that 
B is obtained from A by an e-step and we write A — >™ B, or simply A — > e B. 

Finally, let B be obtained from A by identifying two distinct vertices p and q, 
and then reducing the resulting automaton. Then we say that B is obtained from 
A by an i-step and we write A — >f~ q £>, or simply A — >\ B. 

Note that if A — > e x P @, A — > re B, A — > e B or A — >i B, then B is a reduced 
inverse automaton. 

We first record a few facts. 

Fact 2.1. Let tibca reduced word labeling a path in A from a state p to a state 
p', and from a state q to a state q' , 

U I U I 

p — >p, q — >q. 

By definition of the reduction of dual automata, the identification of p and q 
implies that of p' and q', and the converse holds as well. Thus A — >f~ q B if and 
only if A — >f =g ' B. 

Let us now examine in detail the effect of an operation of the form — > e xp • 

Fact 2.2. Let p, q be states of A and let w be a non-empty reduced word. Let 
u be the longest prefix of w that can be read in A from state p, and let v be the 
longest suffix of w that can be read in A to state q (that is, v^ 1 is the longest 
prefix of w _1 that can be read in A from state q). We distinguish two cases: 

(1) If \u\ + \v\ < \w\, then w = uw'v for some non-empty reduced word w' . 
If we let p' (resp. q 1 ) be the end (resp. start) state of the path labeled u 
(resp. v) and starting in p (resp. ending in q), 

U / w' I V 

p — >p — >q — >q, 

then the reduction process on the result of the expansion of A by (p, w, q) 
identifies the \u\ first edges and the \v\ last edges of the added path with 
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existing edges of A, so that A — ►exp"' 9 ^ B if and only if A — >ix P ' w :9 ' B 
and the latter is a reduced expansion. 
(2) If \u\ + \v\ > \w\, then there exist words x,y,z, possibly empty, such that 
u = xy, v = yz and w = xyz. Let p',p" , q' , q" be the states of A defined 
by the following paths 

x i V ii i V ii z 

P — >P — >P , q — y q — >Q- 

Then A — yjg$ B,v) B if and only if A — >f =q ' B, if and only if A — >f" =q " 
B. 

We derive from Fact 12.21 the following statement. 

Proposition 2.3. Let A and B be inverse automata. If A — >™ B, then A — r, B 
or some states p and q and a reduced word u such that \u\ < \w\. 

The following converse statements are derived from Facts 12.11 and 12.21 

Proposition 2.4. Let Abe a reduced inverse automaton, let H = L(A)p, let u and 
v be reduced words labeling paths q qo — ^ p in A, and suppose that A — >f~ q B. 
Then A — yf B and L{B)p = (H, uv) . 

In particular, rank(L(B)p) < 1 + rank(H). If rank(L(B)p) = 1 + rank(if), then 
H <ff L(B)p and if C is a basis of H, then C U {uv} is a basis of L{B)p. 

Proof. Let A 1 be the expansion of A by (qo, uv, qo). The analysis in Fact 12.21 (2) 
shows that a step in the reduction of A' is provided by the automaton obtained 
in identifying p and q. By Proposition 11.21 we have L(B)p = (H,uv) (hence the 
rank inequality), and the uniqueness statement in Proposition ll.3l thcn shows that 
A — > e ™ B. 

Let us now assume that rank(L(B)p) = 1 + rank(_ff), and that C is a basis of 
H . Then C U {uv} is a generating set of L(B)p with cardinality equal to the rank 
of L(B)p, so C U {uv} is a basis of that subgroup by [12, Prop. 1.3.5]. □ 

Proposition 2.5. Let A and B be reduced inverse automata, let w be a reduced 

word such that A >^' w ' q ^ B, let H — L(A)p, and let u and v be reduced words 

labeling paths q — > qo ~—^p in A. Then A >'^ wv B and L(B)p = (H, uwv) . 

Moreover, H <ff L(B)p, rank(L(B)p) = 1 + rank(i/) and if C is a basis of H, 
then C U {uwv} is a basis of L(B)p. 

Proof. Since the expansion of A by (p, w, q) is a reduced expansion, the word uwv 
is reduced and the expansion by (qo,uwv,qo) falls in the situation described in 
Fact EH (1). Together with Proposition O it follows that A — >™™ B, which 
concludes the verification that of the first part of the proposition. 

The free factor and the rank statements follow from the statement on a basis 
for L(B)p, which is a direct consequence of the definition of a reduced expansion 
and of the discussion on bases and spanning trees in Fact 11.81 □ 

We now introduce a measure of the cost of a reduced expansion or an i-step 
a, written A(er): if a is an i-step, then A(cr) = 0; if a is a reduced expansion, 
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a = — s-re'™' 9 , its cost is the length of w, A(u) = \w\. We extend this notion of 
cost to finite sequences of i-steps and reduced expansions: if a = (a±, . . . ,a n ) is 
such a sequence, we let 

X(a) = (X(a 1 ),...,X(a n )). 

Finally, we introduce an order relation on the set of finite sequences of non-negative 
integers. Let k = (hi,..., k„) and 1= (£i, . . . , £ m ) be such sequences. We say that 
k ■< I if 

either n < to, 

or n = to and Yh=i kl < Y^Li _ 

or n — to, Ylf—i ki — Y^Li f-i an( i ^ precedes £ in the lexicographic order. 
It is routine to check that ^ is a well-order on the set of finite sequences of 
non-negative integers, which is stable under the concatenation of sequences. We 
write k -< £ if k ^ I and k ^ £. 

Proposition 2.6. Let A, A' and B be inverse automata such that A' is obtained 
from A by a reduced expansion 0\ and B is obtained from A' by an i-step 02, 

Then there exist a sequence of reduced expansions or i-steps a' of length 1 or 2 
such that B is obtained from A by applying the steps in a' and A(cr') -< \(o~\,o~2)- 

Proof. Suppose that A — >^ w ' q) A' — > r = s B with r + s. The cost of this 
sequence of transformations is (\w\, 0). 

Let Q be the state set of A and let u and v be reduced paths, 

V u 

q — ><?o — >p- 

Then uwv is a reduced word and L(A')p = (L(A)p,uwv) by Proposition 12.51 We 
distinguish three cases, depending whether or not r and s lie in Q. 
Case 1: Both r and s are in Q. Let x and y be reduced words labeling paths in A 
s — qo -^-> r. Then the same words label similar paths in A' and it follows from 
Proposition 12.41 that 

L(B)p= (L(A')p,xy) = (L(A)p,uwv,xy). 

Let also A" and B' be determined by A — >l= s A" — >f" v B' . Then L(B')p is 
also equal to (L(A)p, xy, uwv), so that B — B' by Proposition II. 31 

Note that the words u and v label paths from and into state go in A" as well. 
It follows from Proposition ^. 31 that, if uwv ^ L(A"), then B can be obtained from 
A" by an i-step or by a reduced expansion of the form — > r ^ :Z '' ' with \z\ < \w\. 

Thus B is obtained from A either by a sequence of 1 or 2 transformations, of 
cost or (0, k) with < k < \w\. This is -<-less than Qw\, 0), as expected. 



TITLE WILL BE SET BY THE PUBLISHER 



13 



Case 2: Exactly one of r and s is in Q. Without loss of generality, we may assume 
that r S Q and s $ Q. Let z be a reduced word labeling a path from qo to r in 
.A, and hence also in A 1 . Let 5 be the unique reduced word labeling a path from 
p to s in A' , using only edges that were not in A. By assumption, g is a proper, 
non-empty prefix of w. Moreover, by Propositions 11.21 and 12.41 

L[B)p = (L(A')p,ugz~ 1 ) = (L(A)p,uwv,ugz^ 1 ). 

Let h be the longest common suffix of g and z, so that g = g'h, z = z'h, g' z' 1 
is reduced and we have the following paths in A' , 

z' 1 h g 1 h 

qo — >r — > r, p — >s — >s. 

Fact 12.11 shows that A' — >J =s £>, so we may assume that h = 1, g = g' and 
z = z' . There is a possibility that the word g is now empty (if h was in fact equal 
to g), but in that case, we are returned to the situation of Case 1, with s' = p. 
Thus we may still assume that g ^ 1. In particular, the word ugz~ l is reduced. 

Then let A" and B' be defined by A — ^ gz ~ 1 A" — >% wv B'. Again L(B')p = 
(L(A)p,uwv,ugz~ 1 ), so B = B' by Proposition II. 31 

Proposition 12.31 states that each e-step can be replaced by an i-step or by a 
reduced expansion of cost bounded above by the cost of the e-step. Going back to 
Fact 12. 2\ we see that the e-step A — >^ gz A" can be replaced by a transformation 
of cost k < \g\ since both u and z can be read from state qo in A (in fact, of cost 
exactly \g\ by definition of g). As for the e-step A" — >^ wv B, it can be replaced 
by a transformation of cost t < \w\ — \g\ since ug (a prefix of uw) and v can be 
read to state qo in A". 

Now, it suffices to verify that (fc, £) ~< (\w\, 0), which is easily done if we observe 
that k + 1 < \w\ and k < \w\. 

Case 3: Neither r nor s is in Q. Without loss of generality, we may assume that r 
occurs before s along the in-labeled path from p to q. Thus, the word w factors as 
w — W1W2W3 and the path in A' made of edges added to A factors as 

Wi W2 W3 

p > 1 > s > q. 

Since r^s and these vertices are not in Q, each of the three factors Wi, W2, W3 is 
non-empty. Moreover, 

L(B)p— {L(A')p,uwiW3v) = (L(A)p,uwv 1 uwiW3v). 

Let h be the longest common suffix of w\ and w^ 1 , so that w% = w[h, W3 = 
h~ l w' s , w^w's is reduced and we have the following paths in A', 

w[ . h w 2 h 1 w 3 

p — ► r — ► r > s < — s > q. 

Proposition 12.11 shows that A' — >f =s B, so we may assume that h = 1, W\ = w[ 
and W3 =w' 3 . There is a possibility that the words w\ or w$ be now empty (if h 
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was in fact equal to 101 or IU3), but in that case, we are returned to the situation 
of Cases 1 or 2, with r' = p or s' = q. Thus we may still assume that w± 7^ 1 and 
W3 ^ 1. In particular, the word uwiw^v is reduced. 

Then let A" and B' be defined by A — >f>w A" — ^ wv B'. Then L{B')p = 
(L(A)p,uwv,uwiW3v), so B = B' by Proposition II. 31 

As in Case 2, we use Factl2~2"lto verify that the e-step A — ►™ lW3 * A" can be 
replaced by a reduced expansion of cost k = \wiWz\ since u and v are the maximal 
prefix and suffix of uw\Wqv that can be read from and to state qo in A. As for the 
e-step A" — >e WV B, it can be replaced by a reduced expansion of cost I = |w 2 
since uw\ and W3V are the maximal prefix and suffix of uwv that can be read from 
and to state go m A" '■ 

Now, it suffices to verify that (fc, £) ~< (\w\, 0), which is easily done if we observe 
that k + 1 = \w\ and k < \w\. □ 

3. Deciding the free factor relation 

3.1. A GEOMETRIC CHARACTERIZATION OF FREE FACTORS 

We put together the technical results from Section [2] to prove the following 
characterization of free factors. 

Theorem 3.1. Let H, K be finitely generated subgroups of F — F(A) and assume 
that d = rank(/iT) — rank(_ff) > 0. Then H is a free factor of K if and only if the 
inverse automaton T a{H) can be transformed in Ta(K) by a sequence of d! < d 
i-steps followed by d — d! reduced expansions. 

Proof. We first observe that H is a free factor of K if and only if there exist d 
elements fci, . . . , fed of ^(^4) such that (H U {fci, . . . , kd}) = K. This follows from 
the fact that an r-element generating set in a rank r free group, is a basis [12, Prop. 
1.3.5]. 

By definition of e-steps, this means that H <ff K if and only if Ta(H) yields 
Ta{K) by a sequence of d e-steps. 

Now Propositions 12.31 12.41 and 12.51 show that this is equivalent to the fact that 
Ta{H) yields Ta(K) by a sequence of d i-steps or reduced expansions. 

Since ^ is a well-order on the set of finite sequences of non-negative integers, 
we may consider a sequence a of d i-steps and reduced expansions leading from 
Ta{H) to Ta(K), which is ^-minimal. Proposition ^. 61 together with the stability 
of < under concatenation, then shows that the i-steps in a come before the reduced 
expansions. Thus, H <ff K if and only if Ta(H) yields Ta{K) by a sequence of 
d 1 i-steps followed by d — d! reduced expansions. □ 

It follows from the discussion on bases and spanning trees in Fact 11.81 that if 
Ta(H) embeds in Ta(K), then H is a free factor of K. Not every free factor of K 
occurs that way, and those that do are called the graphical free factors of K [with 
respect to A). It is easily verified that H is a graphical free factor of K if and only 
if Ta{H) yields Ta{K) by a sequence of reduced expansions. 
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Corollary 3.2. Let H, K be finitely generated subgroups of F = F(A) and assume 
that d = rank(i-i') — rank(_ff ) > 0. Then H is a free factor of K if and only if the 
inverse automaton T^(H) can be transformed by a sequence of d' < d i-steps into 
some Ta(L) such that rank(L) = rank(i/) + d' and L is a graphical free factor of 
K with respect to A. 

Proof. Let us first assume that H <ff K . By Theorem l3.H for some d' < d, Ta(H) 
can be taken to some Ta(L) by a sequence of d' i-steps, and T a{L) can be taken to 
Ta (K) by a sequence of d — d' re-steps. Since rank(if) = rank(_ff) + d and an i-step 
or an re-step can increment the rank by at most one (Propositions 12.41 and 12. 5|) , L 
must have rank exactly rank(ff) + d' . 

Conversely, suppose that a sequence of d' < d i-steps takes Ta(H) to Ta(L) 
in such a way that rank(L) = rank(i7) + d' and Ta{L) embeds in Ta{K). By 
Propositions 12.41 and 12.51 again, we have H <ff L <ff K, and hence H <ff K. □ 

Remark 3.3. We observe the following by-product of the proof of Corollary 13.21 
if Ta(H) can be transformed into Ta(L) by a sequence of d! i-steps such that 
rank(i) = 6! + rank(if), then for every i-step A — >; B occurring in that sequence, 
we have rank(L(B)p) = 1 + rank(X(„4)p). We say that such an i-step is rank- 
incrementing. 

In the special case where K — F, we have the following statement. 

Corollary 3.4. Let H be a finitely generated subgroup of F = F(A), let Aq 
be the set of letters in A that occur in V ' a{H) and let d = \A$\ — rank(iZ) = 
rank(F(j4o)) — rank(iJ). Then H is a free factor of F if and only if d > and 
Ta(H) can be transformed into a one-vertex automaton by a sequence of d i-steps. 

Proof. By Corollary 13. 2\ H is a free factor of F{A) if and only if Ta(H) yields 
a one-vertex automaton B by a sequence of d 1 < \A\ — rank(_ff) i-steps, in such a 
way that L(B)p has rank d! + rank(i7). Now the set of letters occurring in such 
an automaton B must be exactly A , so L(B)p — ^(^o) and d' = d. □ 

3.2. Deciding whether H < fF F 

We first consider the problem of deciding whether a given finitely generated 
subgroup H of F = F(A) is a free factor of F. With the notation of Corollarv l3.4l 
and Remark 13. 3[ the algorithm to decide whether H <ff F consists of the follow- 
ing. We need to explore the sequences of rank-incrementing i-steps, starting from 
Ta(H) and of length d — \A \ — rank(_ff). Then H <ff F if and only if one of 
the automata occurring at the end of one of these sequences has a single vertex. 
Note that no automaton obtained after less than d i-steps could be a single- vertex 
automaton. 

These automata can be viewed as nodes of a tree, rooted at Ta{FI), in which 
the children of a node are the automata produced by a rank-incrementing i-step. 
Let n be the number of states of Y a{FL). Then every automaton occurring along 
this tree has at most n states. 
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If A is such an automaton, then A has at most \{n 2 — n) pairs of distinct 
vertices, and hence at most \{n 2 — n) children, each of which has at most n— 1 
states. The computation of the children of A is done by computing all the (at most 
\{n 2 — n)) possible i-steps, computing the ranks of the corresponding subgroups 
and retaining only those of rank 1 + rank(L(A)p). It follows from Fact 11.81 that 
the cost of the computation of the children of A is 0(n 4 ). 

Moreover, there are at most 0(n 2d ~ 2 ) nodes of the tree at depth at most d— 1, 
and the computation of these nodes and their children requires time 0(n 2d+2 ). For 
each of the 0(n 2d ) automata at depth d, the verification whether the automaton 
has a single node takes constant time, so the total cost of the algorithm is 0(n 2d+2 ). 

Finally, if H is given by a finite set of generators, of total length £, we recall 
that computing Ta(H) takes time 0(£ 2 ) and that Ta(H) has at most £ states and 
£ edges. This discussion justifies the following statement. 

Theorem 3.5. Let A be a fixed alphabet. Then there is an algorithm which, 
given reduced words h\, . . . ,h n in F(A) with total length £, decides whether the 
subgroup H generated by the hi is a free factor of F(A) in time 0(£ 2d+2 ), where 
d = \Aq\ — rank(iJ) and Aq is the set of letters in A that occur in the hi. 

Remark 3.6. The tree exploration described above can be speeded up by the 
following observation. If the automaton A occurs in a sequence of rank-incre- 
menting i-steps from Ta(H) to the one- vertex automaton Ta{F(Aq)) (a winning 
sequence), then L{A)p <ff F(Aq), which implies that L(A)p is a free factor of 
every subgroup of F(A ) containing it. It follows that, if some i-step A — B 
does not increment the rank, then A does not occur in a winning sequence, that 
is, we may ignore the subtree below A. 

There are undoubtedly other implementation tricks and ideas that can reduce 
the decision process, however without changing the worst-case complexity. 

Remark 3.7. In view of Touikan's announcement (see Fact 11.8( 1. the time com- 
plexity in the above theorem can be lowered to 0(£ 2d+1 log* £). 

The above discussion of complexity depends on a uniform cost assumption, by 
which we assume that the elementary operations on A (reading or writing a letter, 
comparing two letters) require unit time. In other words, we have been assuming 
that A is fixed, and not part of the input. We will shortly consider the problem 
of deciding whether H <ff K where both H and K are part of the input, and for 
the purpose of that discussion, we reconsider the algorithm and the complexity 
established above under the bit cost assumption: we let r be the cardinality of A, 
and we consider that each letter is identified by a bit string of length at most logr, 
so that the elementary operations on A require O(logr) units of time. 

Let f{£,r) be the complexity of reducing a dual automaton on A with at most 
£ vertices and £ edges. The computation of the automaton obtained by an i-step 
from an £- vertex automaton takes time at most f(£,r). To compute the tree 
of rank-incrementing i-steps rooted at Ta{H), we need to compute the at most 
0(£ 2 ) children of at most £ 2d ~ 2 nodes, which requires time 0{£ 2d f{£, r)). Finally, 
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deciding whether an automaton has a single vertex takes constant time, so the 
total complexity of the algorithm is 0(£ 2d f(£, r)). 

We now give an upper bound of f(£,r). Let A be an A-automaton with at 
most I states and I edges. The identifiers of states require space 0(\og£) and the 
identifiers of letters require space O(logr). Moreover, we assume that the set of 
states and the set A are linearly ordered, and equipped with a constant time next 
function. 

The automaton can be viewed as the lexicographically ordered list of all triples 
(u, a, v) such that either a £ A and there is an a-labeled edge from state u to state 
v, or a E A and there is a a-labeled edge from state v to state u. Each entry of 
this list requires space 0{\ogi + logr) = 0(\og(£r)), and the list contains at most 
£ entries. In particular, a complete scan of the list takes time 0(£\og(£r)). 

The reduction of A consists in performing elementary reductions of type 1 as 
long as it is possible, and then elementary reductions of type 2. To find out whether 
a type 1 reduction is possible, one needs to scan the list to find two (consecutive) 
cells of the form (u,a,v) and (u,a,v'), requiring 0(^log(fr)). Performing the 
identification consists in reading through the list, replacing every occurrence of v' 
by v, and reordering the list: this takes time 0(£\og(£r)). To find out whether 
an elementary reduction of type 2 is possible, one scans the list to find a vertex u 
such that there is a unique triple of the form (u, a, v) in the list: again, this takes 
time 0{£\og(£r)). Finally, performing the reduction consists simply in deleting 
the entries (u, a, v) and (v, a, u) in the list. 

Summarizing, since we will perform at most £ reductions, we can take the 
function f(£,r) to be equal to £ 2 \og(£r). This yields the following statement. 

Theorem 3.8. There is an algorithm which, given an alphabet A of size r and 
reduced words hi, ■ ■ ■ , h n £ F{A) with total length £, decides whether the subgroup 
H generated by the hi is a free factor of F(A) in time 0{£ 2d+2 \og(£r)) 7 where 
d = \Aq \ — rank(iJ) and Aq is the set of letters in A that occur in the hi. 

3.3. Deciding whether H < f f K 

We now suppose that H and K are subgroups of F(A), given by sets of genera- 
tors with total length I, and we consider the problem of deciding whether H <ff K . 
The algorithm is the following: we first compute Ta(K) and we verify whether 
each generator of H can be read as a loop at go, the designated vertex of Ta(K). 
If not, then H is not contained in K, and cannot be a free factor. 

We now assume that H < K and we compute a spanning tree T of the graph 
Ta(K). This spanning tree determines a basis B of K, indexed by the edges of 
Ta(K) that are not in T (see Fact II. 8| ). and we express the given generators of 
H in terms of this basis: it suffices to read the generators of H as loops at the 
designated vertex in Ta(K), and to record the sequence of edges traversed and 
not in T . In particular, each generator of H is expressed as a reduced word in 
B* , that is no longer than its expression as a reduced word in A*. We then use 
this expression of the generators of H to construct T B (H), and the algorithm in 
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Section I3~2l to decide whether H is a free factor of the ambient free group, namely 
K. 

Let us now discuss the complexity of this algorithm, assuming that H and K are 
given by tuples of generators of total length £ and that F has rank r. As discussed 
in the previous section, computing Ta(K) takes time 0(£ 2 \og(£r)). Running a 
word w E F(A) in the automaton Ta(K) requires reading sequentially each letter 
of w (time O(logr)) and looking for the corresponding transition in the table 
representing the automaton (time 0(£\og(£r))). If w has length £', this takes time 
0(£'£\og(£r)). In order to decide whether H is contained in K, this must be done 
for every generator of H, and since the total length of these generators is at most 
£, this requires time 0(£ 2 \og(£r)). 

Assuming that H is indeed contained in K, the next step is to construct a 
spanning tree T of Ta(H), for instance by marking certain edges in the list rep- 
resenting Ta(K). Again, this can be done in time 0(£ 2 \og(£r)). The positively 
labeled edges of Ta{K ) not in T are in bijection with a basis B of K . In particular, 
the rank of K is at most £, an upper bound of the number of edges in Ta{K). The 
elements of B, seen as words in F(A), consist of the label of a path in T, followed 
by the label of an edge not in T, followed by the label of a path in T. In particular, 
their length is at most twice the number of vertices of Ya{K) plus one, that is 
0{£). But we do not need to compute these words: it suffices to number (from 1 
to rank(i^) < £) the positively labeled edges not in T. Then, for each generator h 
of H, reading h in Ta(K) from state go and keeping track of the (identifier of the) 
edges traversed that are not in T, provides an expression of h in B. Moreover, the 
total length of the expression of the generators of H in this basis of K is at most 
1. 

We now simply apply Theorem l3.8l to a length £ set of generators of a subgroup 
of a free group of rank at most £, to justify the following corollary. 

Corollary 3.9. Given tuples of generators for subgroups H and K of F(A), with 
total length £, one can decide whether the subgroup H is a free factor of K in time 
0{£ 2d+2 \og{£r)), where d = rank(K) - rank(iT). 

Remark 3.10. Corollary 13 . 21 suggests an alternative algorithm to decide whether 
H <ff K: one may explore the sequences of rank-incrementing i-steps of length at 
most d = rank(i^T) — rank(iJ), starting from Ta(H) and producing only represen- 
tations of subgroups contained in K. Each Ta{L) occurring in such a sequence 
must be checked to verify whether it can be embedded in Ta{K): if it can, then 
H <ff K, and if it cannot, then the automata produced by a rank-incrementing 
i-step from it must be computed and checked in their turn. 

3.4. Complement of a free factor 

By definition of a free factor (see the introduction), H <ff K if there exists a 
basis C of H and a disjoint set D C K such that C U D is a basis of K. In that 
case, the subgroup generated by D is called a complement of H in K . It is easy 
to see that this complement is not uniquely defined. 
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If H is indeed a free factor of the free group F, the algorithm described in 
Section [3. 21 also provides a sequence of rank- incrementing i-steps taking Ta(H) to 
Ta{F(Aq)), where A is the set of letters of A that occur in the generators of H. 
Repeated applications of Proposition 12.41 then yield a basis of F(A ) containing a 
basis of H, and hence a complement of H in F(A Q ). 

We can be a little more precise. If A is a reduced inverse automaton, let 
d\am qo (A), the q^-diameter of A, be the longest shortest path in A from qo to a 
state, that is 

diam go („4) = max ^min{|u| | u is a reduced word and qo — "^q in .4}^. 

Then Proposition 12.41 shows that if A — >\ B and L(A)p <ff L(B)p, then a com- 
plement of L(A)p is generated by a word of length at most 2 diam 9o (.4). 
This justifies the following statement. 

Proposition 3.11. Let H be a free factor of F{A). Then one can construct 
effectively a basis for a complement of H in F(A), consisting of words of length at 
most 2 diam gn (r^(ff)). 

It is interesting to observe that this is a graphical analogue - and a minor 
improvement - of Federer and Jonsson's result (see [12, Prop. 1.2.26]) mentioned 
in Section [1.31 Indeed, if hi,...,h n are a set of generators of H, then every 
edge of Ta(H) is on a loop at qo labeled by some hi, and hence max^ \hi\ > 
2diam 9o (r A (iO). 

Let us now consider the problem of constructing a basis of a complement of H 
in K, where H and K are given finitely generated subgroups of F(A) and H <ff K. 
The algorithm described in Remark l3.10l (based on Corollarv l3.2p shows that one 
can construct effectively a graphical free factor L of K such that H <ff L. As 
above, in view of Proposition 12.41 the same algorithm can produce a basis of a 
complement of H in L consisting of words of length at most 2 diam go (Ta(H)). 
There remains to construct a basis for a complement of L in K. 

In view of Fact 11.81 fand using the notation therein), one can find a (basis of a) 
complement of L in K by considering a spanning tree of r 'a(L), extending it to a 
spanning tree T of Ta{K), and considering the elements b e associated with edges 
of Ta{K) that are neither in Ta{L) nor in T. In particular, the words b e have 
length at most 1 + 2 diam 9o (T(K)). Thus we have the following statement. 

Proposition 3.12. Let H, K be finitely generated subgroups of F(A). Lf H <ff K, 
then one can construct effectively a basis for a complement of H in K , consisting 
of words of length at most max(2 diam ?0 (Ta(H)), 1 + 2 diam go (Ta{K))). 

The authors wish to thank both anonymous referees for their many helpful comments, 
which have substantially contributed to the improvement of the paper. All remaining 
shortcomings are naturally ours. 
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